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COMPOSITIONS AND METHODS FOR 
TARGETED GENE INSERTION 

This application claims priority to U.S. 
Provisional Application No. 50/138,968, Filed June 8, 
1999, the entirety of which is incorporated by reference 
herein. 

5 

FIELD OF THE INVENTION 

This invention relates to the field of 
molecular biology and manipulation of the eucaryotic 
genome. In particular, the invention provides a novel 
10 system and DNA constructs for integrating heterologous 
DNA segments at selected locations in target genomes. 



BACKGROUND OF THE INVENTION 

Various scientific articles are referred to in 
15 parentheses throughout the specification, and complete 
citations are listed at the end of the specification. 
These articles are incorporated by reference herein to 
describe the state of the art to which this invention 
pertains . 

20 The ability to create a null mutation in a 

specific gene can provide an unambiguous test of the 
functional role of its gene product in an organism. 
Creating a null mutation has obvious advantages over 
approaches utilizing antisense transcripts since null 

25 mutations do not present problems such as incomplete 
suppression of the target gene product and unknown 
specificity of its effects. In addition, the dominant 
nature of the antisense approach for gene suppression is 
one major drawback. Thus, if the gene of interest is 

30 critical for survival or fitness of the organism, one may 
inadvertently select againsc transf ormants that have 
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effectively "shut down" the expression of the target 
gene. Alternatively, one may select for spontaneous 
second site mutations that compensate for the defect 
caused by the gene suppression, In contrast, the 
5 recessive nature of targeted gene insertion via 

homologous recombination should avoid these concerns. 
The first targeted progeny should be in the heterozygous 
state and in most cases, a wild-type phenotype would be 
expected. The phenotype (s) caused by the loss of the 

10 targeted gene can be studied in the homozygous progenies 
of subsequent generations. In this way, even 
housekeeping genes that are essential can be studied as 
embryo lethals. 

Another type of reverse genetics approach is 

15 the so-called "gene machine" screens, in which a large 
collection of random T-DNA or transposon integration 
events are screened by PCR to identify insertions in or 
near the locus of interest (1, 4, 6) . Although this 
technique has been successful in the identification of 

2 0 insertion mutations for genes of interest, a routine gene 
targeting approach should be more versatile in the 
directed mutagenesis of specific genes. For example, 
approaches such as gene swapping, the so-called "knock- 
in" mutation, or any other precise alterations at the 

25 locus of interest are not possible with the "gene 
machine" approach. 

Although gene targeting has become a well- 
defined technique in mouse research (2) , the specific 
disruption of a non-selectable locus in higher plants has 

30. not been reported until recently. Most of the earlier 
work on gene targeting in higher plants involved the 
'repair/mutation of a selectable marker gene (8-10) . The 
observed frequencies of recombination using that method 
were invariably low (9) . 

35 For ectopic expression studies in which a 
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desired gene product is to be produced, the specific 
targeting of the transgene to a preselected locus should 
minimize variations in transgene activity due to position 
effects and/or cosuppression . Since neither one of these 
5 phenomena is well understood at the mechanistic level, 
the currently available strategy to obtain the desired 
expression levels and pattern is to screen a large number 
of independent transformed lines. This can be laborious 
and time-consuming. Insertion of the desired transgene 

10 into a preselected locus of the genome would avoid these 
technical problems and help streamline the process of 
engineering a desired trait in the plant of interest. 

For commercial purposes, specific manipulation 
of the genome through gene targeting should also greatly 

15 facilitate the process of registration for the transgenic 
materials. At the present time, clinical trials to 
demonstrate the safety of transgenic crops are needed for 
any new transgenic plant materials to be released as 
commercial products. One of the major reasons for this 

20 requirement is to safeguard against production of 

allergens or toxins in new transgenic lines that may 
result from inadvertent mutations due to random insertion 
events. Targeted gene insertion would obviate this 
concern and thus result in substantial savings of time 

25 and resources for this phase of product 
commercialization. 

One approach to targeting employed a general 
gene- targeting construct using a kanamycin resistance 
gene (Nptll) as a positive selection marker (7) . 

3 0 Polylinkers were designed in the flanking regions of this 
marker to facilitate the cloning of genomic fragments 
from the target of "interest. To facilitate detection of 
illegitimate insertion events, a GUS expression unit, 
which is a screenable marker, was inserted outside of the 

35 homologous regions. In the event of a double cross-over 
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targeting event, the resultant plant cell is kan r GUS". 
The feasibility of this vector was demonstrated by 
targeting of the Nptll marker into the TGA3 locus, which 
encodes a transcription factor of Arabidopsis (7) . The 
5 absence of a negative selection marker enabled a direct 
estimate of the relative frequency of the targeting event 
to that of random insertion. In one set of experiments, 
this number was in the range of 1 to 2 targeting events 
per 2,58 0 transf ormants . This methodology was used to 

10 create the first "knock-out 11 Arabidopsis plant (3) . In 

that work, a targeted event at the AGL5 locus (which also 
encodes a transcription factor) was recovered in 1 of 
about 75 0 transf ormants (3) . 

Although successful targeted gene insertion was 

15 achieved using the strategy outlined above, that strategy 
remains limited in several important respects. First, a 
large number of independent transformation events are 
needed in order to obtain the infrequent homologous 
recombination events rather than the more frequent random 

2 0 integration events. This limitation precludes use of the 
current methodology in most species of agronomic 
interest. Second, the currently available method has no 
means for limiting the number of complex integration 
events that could occur because the number of 

2 5 recombination substrates is greater than one per cell; 
nor does it allow for application of negative selection 
strategies which could expedite the process of detecting 
the rare targeted integration events. 

Despite the obvious value of targeting 

30 mutations to specific, selected locations within the 
genome, sometimes it is yet desirable to screen for 
particular phenotypes among a population of random 
mutations. The method of activation tagging (14, 15) for 
creating random mutants has proven valuable- The method 

35 involves tagging genes at random by the insertion of DNA 
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constructs comprising a selection marker gene and 
transcriptional promoters which are able to transactivate 
the expression of genes in the vicinity of the insertion. 
The result is the generation of dominant mutants which 
5 survive in the presence of the selection agent and which 
have been useful in studying genetic influences on plant 
growth substances, polyamine metabolism, signal 
transduction by cytokines and abscisic acid, for example. 
The method retains the disadvantage of requiring large 
10 numbers of transf ormants which precludes application to 
many agronomically important plants. 

SUMMARY OP THE INVENTION 

The present invention provides a new process 
15 and new DNA constructs and vectors for targeted 

manipulation of eucaryotic genomes. One key feature of 
this novel approach for gene targeting is the generation 
of recombinant substrates through the deployment of 
transposable elements. In a preferred embodiment, it 
20 provides a key advantage by minimizing the number of 

recombination substrates to one per cell. Among other 
advantages of the approach, it solves two general 
problems associated with currently available methods. 
First, it minimizes the number of independent 

2 5 transformation events required to obtain the infrequent 

homologous recombination events rather than the more 
frequent random integration events. This enables the 
application of gene targeting technologies to more 
organisms of interest. Second, the novel deployment of 

3 0 powerful negative selection strategies streamlines the 

recovery of low- frequency homologous recombination events 
by suppressing or eliminating complex integration 
processes . 

According to one aspect of the invention, a 
35 general DNA construct for producing a gene targeting 
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construct is provided. The DNA construct is bounded by 
termini within which are a pair of DNA substrates for a 
selected transposase. These DNA substrates contain 
between them the following elements: (1) a first cloning 
5 site and a second cloning site for insertion of one or 
more additional DNA segments, wherein the first cloning 
site and the second cloning site have disposed between 
them a positive selection gene encoding a gene product 
that confers to the cells a selectable phenotype 

10 comprising resistance to a positive selection agent that 
is deleterious or lethal to cells having genomes in which 
the DNA construct has not integrated; and (2) a negative 
selection gene disposed between one of the DNA substrates 
for the selected transposase and either the first cloning 

15 site or the second cloning site, but not between the 
first cloning site and the second cloning site, the 
negative selection gene conferring to the cells a 
selectable phenotype comprising susceptibility to a 
negative selection agent, to which cells having genomes 

2 0 in which the DNA construct has not integrated are not 
susceptible. Optionally, the DNA construct contains a 
detectable marker gene encoding a detectable gene 
product. The detectable marker gene is operably inserted 
in the DNA construct relative to one of the DNA 

25 substrates for the selected transposase such that, upon 

excision of the DNA construct from a genome by the action 
of the transposase, the detectable gene product is no 
longer detectable. Preferably, the detectable marker 
gene is inserted in the construct such that one of the 

30 DNA substrates for the selected transposase is located 
within the gene, between its promoter and coding 
sequence . 

The DNA construct described above can be used 
for random insertion of a gene of interest into a genome. 
35 In accordance with a significant aspect of the invention, 
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however, it is adapted for integrating a heterologous DNA 
segment at a pre -determined location of a genome. The 
adaptation comprises inserting a first targeting segment 
in the first cloning site and a second targeting segment 
5. in the second cloning site. Each targeting segment 
comprises a DNA sequence substantially homologous to 
sequences in the genome comprising or flanking the pre- 
determined location. The presence of the targeting 
segments enables the DNA construct to integrate into the 

10 genome at the pre-determined location by homologous 
recombinat ion . 

In a preferred embodiment, the above -described 
DNA constructs are operably inserted into a vector for 
transforming cells. Preferably, for the transformation 

15 of plant cells, the vector is an Agrobacteriuin vector. 

According to. another aspect of the invention, 
the following method for inserting a heterologous DNA 
molecule into a pre-determined location on a plant genome 
is provided, utilizing the above -described AgroJbacterium 

20 vector. 

Step 1. Cells are transformed with the vector. 
The DNA construct can integrate into the genome randomly 
(more frequent) or by homologous recombination with the 
targeted genomic DNA sequence (less frequent) . At this 

25 stage, transf ormants with random insertions are selected 
based on their resistance to the positive selection agent 
and sensitivity to the negative selection agent, and 
(optionally) their expression of the detectable gene 
product. In a preferred embodiment, transf ormants with a 

30 single copy of the transf crming DNA are selected. Cells 
transformed with this DNA construct are referred to as 
''substrate-transformed" cells . 

Step 2 . Homozygous transgenic plants 
containing the transforming DNA are regenerated from the 

35 selected substrate-transformed cells. These plants are 
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crossed with a line that expresses the transposase 
specific for the DNA substrates engineered into the 
vector (if created recombinantly , then referred to as a 
"transposase- transformed" line), to produce heterozygous 
5 Fl progeny. The progeny contain the transposase and the 
transforming DNA construct harboring the transposase 
recognition sites. Excision and integration events occur 
naturally in these hybrid plants as they grow, due to the 
presence of the transposase and its substrate. Since the 

10 Fl hybrids are heterozygous, excision at the DNA 

substrate sites on the construct occurs which releases a 
portion of the transforming DNA, referred to as the 
"recombination substrate" . In a preferred embodiment the 
excision will generate, per cell, a single copy of the 

15 "recombination substrate" . The recombination substrate 
contains the targeting segment with the positive 
selection gene, as well as the negative selection gene 
located outside the targeting segment. 

Step 3. The Fl plants are allowed to self- 

20 pollinate to produce F2 seed. Optionally, this step may 
be carried forward into the F3 and subsequent 
generations. Seeds are germinated on growth media 
containing the positive and negative selection agents. 
Random integration of the recombination substrate into 

25 the genome results in plants that are sensitive to the 
negative selection agent and resistant to the positive 
selection agent. However, integration of the excised 
insert by double crossover events at the targeted locus 
results in plants that are resistant both to the negative 

30 selection agent and to the positive selection agent. 

These plants may be selected and/or maintained by their 
ability to survive in the presence of both selection 
agents, while plants containing random integrants cannot 
survive on the negative selection agent. 

3 5 According to another aspect of the invention, 
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the aforementioned DNA constructs and methods may be 
adapted to perform activation tagging of a plant genome 
to create variants displaying a desired phenotype . In 
this case, the selection step for the progeny omits the 
5 negative selection. Instead, the plants are screened for 
the phenotype desired to be identified by the activation 
tagging method. 

According to another aspect of the invention, 
kits are provided to facilitate performance of the 
10 targeted gene insertion or activation tagging methods 

described above. The kits provide one or more of the DNA 
constructs of the invention, along with instructions for 
their use in performing the methods. 

15 Other features and advantages of the present 

invention will be better understood by reference to the 
drawings, detailed description and examples that follow. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 Figure 1. Generalized AgroJbacterium-based gene 

targeting construct. RB, Right border; LB, Left border; 
PLS1, polylinker sequence 1; PLS2 , polylinker sequence 2; 
CodA, cytosine deaminase -encoding sequence; Bar, 
phosphinothricin acetyltransf erase -encoding sequence; 

25 GUS, |3-glucuronidase-encoding sequence; 35S, CaMV 35S 
promoter; nos, 3 1 polyadenylat ion sequence from the 
nopal ine synthase gene; Ds r excision target (DNA 
substrate) for the Ac-dependent transposase. 

Figure 2. Strategy for Ac-dependent production 

3 0 of recombination substrates in planta for gene targeting. 
The Arabidopsis gene TGA3 is used as an example to 
illustrate the use of Ac-dependent excision to generate 
substrates for recombination. The hatched regions 
designate genome sequences flanking TGA3 and are cloned 
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into the polylinker sites (PLS1, PLS2) of the targeting 
construct as shown in Figure 1. Homozygous transgenic 
plants containing this insert are crossed with a line 
that is expressing the Ac transposase ( 3 5S :Ac/Np tTJ) . 
5 Excision at the Ds sites of the transgene releases a 

single molecule of the recombination substrate. Random 
reintegration of the excised insert produces plants that 
are 5-Fc s , PPT r , and GUS' while the parental line is 5-Fc s , 
PPT r , and GUS + . However, integration of the excised 

10 insert via double cross-over events at the TGA3 locus 
will result in plants that are 5-Fc r , PPT r , and GUS' . 
These can be confirmed by performing PCR with genomic DNA 
using the diagnostic primers PI and P2, as indicated. 
The heterozygous nature of the primary targeted 

15 transformant is illustrated on the bottom by showing the 
wild-type TGA3 allele along with the targeted allele. 

DETAILED DESCRIPTION OF THE INVENTION 
I . Definitions 

20 Various terms relating to the biological 

molecules and other aspects of the present invention are 
used throughout the specification and claims. 

With reference to the mutations of the 
invention, the term "null mutant" is used to designate an 

25 alteration in the genomic DNA sequence of an organism 

that can cause the product or function of the gene to be 
largely absent or nonfunctional. Such alterations may 
occur in coding and/or noncoding regions of the gene, 
including regulatory regions or other regions which when 

3 0 altered cause said product or function to be largely 

absent or nonfunctional. The alterations may include 
insertions and/or deletions of one or more base pairs 
and/or changes in one or more base pairs. 

In reference to the strategic placement of 

3 5 heterologous DNA segments within the genomic DNA, the 
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term "targeted gene insertion" is used to designate the 
designed, engineered, creative and/or logical selection 
of specific genomic DNA sequence (s) of interest for 
insertion, deletion or substitution of one or more base 
5 pairs of DNA. This DNA may encode a detectable or 
selectable gene product or function to facilitate 
identifying and/or isolating successful "targeted gene 
insertion" events. "Targeting" is accomplished by. 
placing "targeting DNA sequences" having homology with 

10 the known, determined or predicted DNA sequence (s) of the 
genomic DNA of interest into the constructs of the 
invention in a manner such that homologous recombination 
may occur. "Targeted gene insertion" may typically 
create a null mutant, but may also create an up- regulated 

15 or down- regulated gene, or may have no ascertainable 

effect on the genomic DNA so altered. The "targeted DNA" 
or "targeted genomic DNA" is the genomic sequence of 
interest from the organism to be transformed. 

In reference to placement of exogenous DNA 

2 0 within the genomic DNA of an organism in locations other 

than those determined by strategic placemen- or designed, 
engineered, creative or logical selection, the terms 
"random insertion" and/or "random integration" are used. 
This DNA also may encode a detectable or selectable gene 
25 product or function to facilitate identifying and/or 

isolating "random insertion" events. "Random insertion" 
may occur by homologous or heterologous recombination 
with genomic DNA sequences. "Random insertion" may 
create a null mutant, an up-regulated or down-regulated 

3 0 gene, or may have no ascertainable effect on the genomic 

DNA so altered. 

With reference to nucleic acid -molecules, the 
term "isolated nucleic acid" is sometimes used. This 
term, when applied to DNA, refers to a DNA molecule that 
35 is separated from sequences with which it is immediately 
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contiguous (in the 5' and 3' directions) in the naturally- 
occurring genome of the organism from which it was 
derived. For example, the "isolated nucleic acid" may- 
comprise a DNA molecule inserted into a vector, such as a 
5 plasmid or virus vector, or integrated into the genomic • 
DNA of a procaryote or eucaryote. An "isolated nucleic 
acid molecule" may also comprise a cDNA molecule. 

The terms "recombinant substrate" or 
"recombination substrates'' refer to the DNA molecules 

10 which are produced in Fl progeny produced by the method 
of this invention. The recombination substrate contains 
the targeting sequence with the positive selection gene, 
as well as the negative selection gene located outside 
the targeting sequence. The recombinant substrates 

15 result from the excision by the transposase activity 

which specifically recognizes the DNA substrates in the 
DNA constructs of this invention. The recombinant 
substrates, when integrated by homologous recombination 
in the F2 progeny, result in organisms which have an 

2 0 insertion in the targeted gene and which are selected by 
their resistance to both the positive and negative 
selection agents. 

The terms "DNA substrate" and "excision site" 
are use in reference to the specific sequences or 

25 locations within the DNA molecules at which the 

transposase enzyme activity of a transposable element 
system can excise flanking DNA sequences. These DNA 
substrates are also referred to herein as ^transposase 
recognition sites" . Each transposase activity has 

30 specificity for its own DNA substrate sequence in a 

manner that is integral to that particular transposable 
e 1 ement sy s t em . 

The term "substantially pure" refers to a 
preparation comprising at least 50-60% by weight the 

35 compound of interest (e.g., nucleic acid, 



WO 00/75289 PCT/US00/1 5783 

- 13 - 

oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 
Purity is measured by methods appropriate for the 
5 compound of interest (e.g. chromatographic methods, 
agarose or polyacryl amide gel electrophoresis, HPLC 
analysis, and the like) . 

Nucleic acid sequences and amino acid sequences 
can be compared using computer programs that align the 

10 similar sequences of the nucleic or amino acids thus 

define the differences. In preferred methodologies, the 
BLAST programs (NCBI) and parameters used therein are 
employed, and the DNAstar system (Madison, WI) is used to 
align sequence fragments of genomic DNA sequences. 

15 However, equivalent alignments and similarity/ identity 
assessments can be obtained through the use of any 
standard alignment software. For instance, the GCG 
Wisconsin Package version 9.1, available from the 
Genetics Computer Group in Madison, Wisconsin, and the 

20 default parameters used (gap creation penalty=12, gap 

extension penalty=4) by that program may also be used to 
compare sequence identity and similarity. 

The terms "percent identical" and "percent 
similar" are also used herein. When referring to nucleic 

25 acid molecules, u percent identical" refers to the percent 
of the nucleotides of the subject nucleic acid sequence 
that have been matched to identical nucleotides by a 
sequence analysis program. 

With respect to single- stranded nucleic acid 

30 molecules, the term "specifically hybridizing" refers to 
the association between two single-stranded nucleic acid 
molecules of sufficiently complementary sequence to 
permit such hybridization under pre -determined conditions 
generally used in the art (sometimes termed 

35 "substantially complementary") . In particular, the term 
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refers to hybridization of an oligonucleotide with a 
substantially complementary sequence contained within a 
single-stranded DNA or RNA molecule, to the substantial 
exclusion of hybridization of the oligonucleotide with 
5 single-stranded nucleic acids of non- complementary 
sequence . 

A "coding sequence" or "coding region" refers 
to a nucleic acid molecule having sequence information 
necessary to produce a gene product, when the sequence is 
10 expressed. 

The term "operably linked" or "operably 
inserted" means that the regulatory sequences necessary 
for expression of the coding sequence are placed in a 
nucleic acid molecule in the appropriate positions 

15 relative to the coding sequence so as to enable 

expression of the coding sequence. This same definition 
is sometimes applied to the arrangement other 
transcription control elements (e.g. enhancers) in an 
expression vector. 

20 Transcriptional and translational control 

sequences are DNA regulatory sequences, such as 
promoters, enhancers, polyadenylation signals, 
terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. 

25 The terms "promoter" , "promoter region" or 

"promoter sequence" refer generally to transcriptional 
regulatory regions of a gene, which may be found at the 
5' or 3 1 side of the coding region, or within the coding 
region, or within introns. Typically, a promoter is a 

30 DNA regulatory region capable of binding RNA polymerase 
in a cell and initiating transcription of a downstream 
(3' direction) coding sequence. The typical 5 f promoter 
sequence is bounded at its 3 1 terminus by the 
transcription initiation site and extends upstream (5 1 

35 direction) to include the minimum number of bases or 
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elements necessary to initiate transcription at levels 
detectable above background. Within the promoter 
sequence is a transcription initiation site (conveniently 
defined by mapping with nuclease SI) , as well as protein 
5 binding domains (consensus sequences) responsible for the 
binding of RNA polymerase. 

A "vector" is a replicon, such as plasmid, 
phage, cosmid, or virus to which another nucleic acid 
segment may be operably inserted so as to bring about the 

10 replication or expression of the segment. 

The term "nucleic acid construct" or "DNA 
construct" is sometimes used to refer to a coding 
sequence or sequences operably linked to appropriate 
regulatory sequences and inserted into a vector for 

15 transf orming a cell. This term may be used 

interchangeably with the term "transf orming DNA" . Such a 
nucleic acid construct may contain a coding sequence for 
a gene product of interest, along with a selectable 
marker gene and/or a reporter gene. 

2 0 The terms "selectable marker gene" or 

"selection marker gene" refer to a gene encoding a 
product that, when expressed, confers a selectable 
phenotype on a transformed cell. "Positive selection 
marker" refers to a gene whose functioning gene product, 

25 when expressed, confers upon a cell the phenotype of 
survival or growth in the presence of a positive 
selection agent which is deleterious or lethal to cells 
which do not possess the "positive selection marker" . 
"Negative selection marker" refers to a gene whose 

30 functioning gene product, when expressed, confers upon a 
cell the phenotype of susceptibility to the presence of a 
negative selection agent to which cells which do not 
possess the "negative selection marker" are not 
susceptible . 

35 The term "reporter gene" . or "detectable marker 
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gene" refers to a gene that encodes a product which is 
easily detectable by standard methods, either directly or 
indirectly. 

A "heterologous" region of a nucleic acid 
5 construct is an identifiable segment (or segments) of the 
nucleic acid molecule within a larger molecule that is 
not found in association with the larger molecule in 
nature. Thus, when the heterologous region encodes a 
mammalian gene, the gene will usually be flanked by DNA 

10 that does not flank the mammalian genomic DNA in the 
genome of the source organism. In another example, a 
heterologous region is a construct where the coding 
sequence itself is not found in nature (e.g., a cDNA 
where the genomic coding sequence contains introns, or 

15 synthetic sequences having codons different than the 

native gene) . Allelic variations or naturally-occurring 
mutational events do not give rise to a heterologous 
. region of DNA as defined herein. The term "DNA 
construct", as defined above, is also used to refer to a 

2 0 heterologous region, particularly one constructed for use 

in transformation of a cell. 

A cell has been "transformed" or " transf ected" 
by exogenous or heterologous DNA when such DNA has been 
introduced inside the cell. The transforming DNA may or 
25 may not be integrated (covalently linked) into the genome 
of the cell. In prokaryotes, yeast, and mammalian cells 
for example, the transforming DNA may be maintained on an 
episomal element such as a plasmid. With respect to 
eukaryotic cells, a stably transformed cell is one in 

3 0 . which the transforming DNA has become integrated into a 

chromosome so that it is inherited by daughter cells 
through chromosome replication. This stability is 
demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population 
3 5 of daughter cells containing the transforming DNA. A 
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"clone" is a population of cells derived from a single 
cell or common ancestor by mitosis. A "cell line" is a 
clone of a primary cell that is capable of stable growth 
in vitro for many generations. 
5 Other definitions may be found in the 

description that follows. 



11 - Description 

To practice the novel gene targeting strategy 
10 of the present invention, a DNA construct for 

transforming cells is needed that has, combined in novel 
fashion, the following elements: (1) targeting segments 
that comprise extended regions of homology with the 
targeted location on the genome; (2) a positive selection 
15 gene contained between appropriate targeting segments; 
(3) a negative selection gene located outside the 
targeting segments; and (4) a pair of DNA substrates for 
a selected transposase, located outside the targeting 
segments and the negative selection gene. Optionally, 

2 0 the region between the targeting segments may also 

contain one or more cloning sites for insertion of 
additional nucleotide sequences. In addition, the 
transforming DNA construct optionally may contain a 
reporter gene which, if present, preferably has disposed 
25 therewithin one of the transposase recognition sites, 
such that upon excision, the activity of the reporter 
gene product is not detectable. In a preferred 
embodiment, the DNA substrate is located between the 
promoter sequence and the gene sequence encoding the 

3 0 reporter gene product, such that upon excision and 

reintegration, the intact promoter remains nears one of 
the ends of the integrated DNA. In another preferred 
embodiment, the DNA substrate is short (<1.5 Kb) such 
that it still retains the specific recognition sites for 
35 the transposase, but does not interfere with the ability 
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of the promoter to drive expression of the reporter gene 
activity. 

The targeting segment is a DNA sequence that 
has homology with a selected region of the genome being 
5 transformed, and which is of sufficient length and 

homology to ensure the homologous recombination event 
necessary for incorporation of the transforming DNA into 
the genome. The targeting segment may comprise regions 
of homology that encompass cr flank the selected target 

10 region of the genome. The targeting segments are 

preferably greater than 500 bases on either side of the 
positive selection gene and optional additional 
nucleotide segments. More preferably, they are greater 
than about 1 kb to 1.5 kb on either side of the positive 

15 selection gene, and most preferably they are at least 2-3 
kb on either side of the positive selection gene. One 
skilled in the art will be able to determine the required 
length of the targeting sequences by considering the 
relative factors of length and relative homology with the 

20 known or anticipated sequence of the targeted genomic 
sequence, the critical factor being that the selected 
targeting sequence allow for the low frequency event of 
homologous recombination. For example, it would be 
appreciated by one skilled in the art that the targeting 

2 5 sequence could be shorter in cases where there is a high 

degree of similarity or identity with the targeted 
genomic sequence, or that the targeting sequence might be 
longer in the case of low similarity or where the 
sequence of targeted genomic DNA is not fully known. 

3 0 The targeting segments can be. selected for 

homologous recombination with any portion of a genome of 
interest. Preferably, however, genome targets comprising 
genes or regulatory regions are selected. Alternatively, 
regions adjacent to or near genes may be selected, such 
35 that insertions may be made without disrupting gene 
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expression. 

The positive selection gene may comprise one of 
many such genes known and used in the art . Useful 
selectable marker systems include, but are not limited 
5 to: genes that confer antibiotic resistances (e.g., 
resistance to kanamycin, hygromycin or bialaphos) or 
herbicide resistance (e.g., resistance to sulfonylurea, 
phosphinothricin, or glyphosate) . In the preferred 
embodiment taught in Example 1, the Bar gene, which 

10 confers resistance to herbicides that are baaed on 
phosphinothricin (PPT), is utilized. 

The negative selection gene also may be one of 
several such genes known in the art. Preferred. for use 
in the invention is the CodA gene, encoding cytosine 

15 deaminase. This enzyme converts the innocuous 5- 

f luorocytoeine to the cytotoxic 5-f luorouracil . Other 
negative selections that can be used in the invention 
include, but are not limited to, the aux-2 gene from the 
Ti-plasmid of Agrrobacterium, the TK gene from SV4 0, 

20 cytochrome f450 from Strep tomyces griseolus, the Adh gene 
from maize or Arabidopsis, or any gene encoding an enzyme 
capable of converting innocuous substances into harmful 
or lethal substances. 

The strategies of the present invention can be 

2 5 used in any system known now or discovered in the future 

to harbor transposable element systems. The present 
invention exemplifies gene targeting in plants using the 
well -characterized Ac/Ds system. Other plant 
transposable element systems suitable for use in the 

3 0 present invention include, but are not limited to: 

Spm(En) I dSpm from maize, Dt/rdt from maize, Mu-MI/Mn from 
maize, and Taml/Tam2 or Tam3/Tam4 from snapdragon. The P 
element from Drosophila melanogaster is also suitable for 
use in the present invention. Persons skilled in the art 
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will appreciate that numerous other organisms, including 
Drosophila, yeast, the nematode, C. elegans, and mammals 
such as mice, contain characterized transposable element 
systems, each of which has potential for use in the 
5 present invention. 

The optional detectable marker gene may be 
selected from any of the numerous genes known and used in 
the art for this purpose. Examples of suitable 
detectable marker genes include, but are not limited to, 

10 genes encoding: 0 -glucuronidase (GUS) , p-galactosidase, 
chloramphenicol acetyl transferase (CAT) , various 
transcription factors, alcohol dehydrogenase and 
lucif erase. In the preferred embodiment taught in 
Example 1, the GUS marker is utilized. 

15 The above -described DNA constructs may be used 

directly or as part of a vector, in accordance with the 
wide variety of transformation methods available to 
persons of skill in the art. 

In one preferred embodiment, the gene targeting 

20 strategy of the invention is applied to plants. 

Transgenic plants can be generated using standard plant 
transformation methods known to those skilled in the art. 
These include, but are not limited to, biolistic DNA 
delivery (i.e., particle bombardment), Agrobacteri u/n 

2 5 vectors, PEG treatment of protoplasts, UV laser 

microbeam, gemini virus vectors, calcium phosphate 
treatment of protoplasts, elect roporat ion of isolated 
protoplasts, agitation of cell suspensions with 
microbeads coated with the transforming DNA, direct DNA 

30 uptake, liposome-mediated DNA uptake, and the like. Such 
methods have been published in the art. See, e.g., 
Methods for Plant Molecular Biology (Weissbach & 
Weissbach, eds., 1988); Methods in Plant Molecular 
Biology (Schuler & Zielinski, eds., 1989); Plant 

35 Molecular Biology Manual (Gelvin, Schilperoort , Verma, 
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eds., 1993); and Methods in Plant Molecular Bioloav - A 
Laboratory Manual (Maliga, Klessig, Cashmore, Gruissem & 
Varner, eds. , 1994) . 

The method of transformation depends upon the 
5 plant to be transformed. The biolistic DNA delivery 
method is useful for nuclear transformation of 
monocotyledenous plants, such as maize. Alternatively, 
Agrobacterium vectors, particularly superbinary vectors 
such as described by Ishida et al . (Nature Biotechnology 
10 14:745-750, 1996) are used for transformation of plant 
nuclei . 

In another embodiment, the DNA constructs of 
the invention are used for activation tagging of plants. 
After transformation using standard plant transformation 

15 methods known to those skilled in the art, selection with 
the positive selection agent, but not the negative 
selective agent will allow the higher frequency random 
integration events to be recovered as well as other 
transformants . These transf ormants will also have 

20 detectable amounts of reporter .gene product activity. 
Transgenic plants are then regenerated from these 
transformants and these plants are then crossed with 
lines expressing the transposase activity corresponding 
to the DNA substrates of the transforming DNA vectors. 

25 The active transposition of the integrated DNA will 
result in progeny with insertions in many different 
locations throughout the genomic DNA. These progeny will 
often contain genes which are being overexpressed due to 
transactivation by a promoter in the integrated DNA which 

3 0 transactivates downstream gene expression. This 

population of transformed and transposed mutants or its 
progeny can then be screened for useful phenotypes. For 
plants of agronomic interest, this method of activation 
tagging has substantial advantages over current methods 

35 in that only small numbers of independent transformants 
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are required, yet the novel application of the 
transposable element system enables the possibility of a 
large population of potential mutations. In a preferred 
embodiment of this method, the DNA substrate of the DNA 
5 construct is inserted in the reporter gene between the 
promoter sequence and the encoding sequence such that 
upon excision, the promoter will be close to one end of 
the excised and reintegrated DNA, and the detectable 
activity of the reporter gene will be largely absent or 

10 nondetectable . In a still more preferred embodiment of 
this method, the DNA substrate used is a short (<1.5 kb) 
Ds element from maize. Without intending to limit the 
invention in any way by explanation, presumably longer Ds 
elements contain transcription termination signals which 

15 would interfere with the expression of both the reporter 
gene as well as any transact ivated genes. In a highly 
preferred embodiment, the multiple copies of a promoter 
are used or promoter (s) with inducible activity or 
tissue- specific activity or other such promoters as would 

20 be known to one skilled in the art to be useful. In 

another preferred embodiment, seed can be collected from 
the transformed and transposed population or its progeny 
to be used for screening for useful phenotypes . 



2 5 The following example is provided to describe 

the invention in greater detail. It is intended to 
illustrate, not to limit, the invention. 



EXAMPLE 1 

30 Transposon-Based Gene Targeting Strategy 

for Plants , Using Aarobacterium Vectors 

This example describes new DNA targeting 
constructs to facilitate the transfer of gene targeting 
35 technology from Arabidopsis to crop plants such as maize. 
The new construct design comprises a more general 
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positive selection marker than used in current systems, 
as well as a substrate-dependent negative selection 
marker to streamline the detection of the desired event. 
The major goal in this new gene targeting strategy is to 
5 avoid the need to directly generate large numbers of 
independent transformation events via Agrobacterium 
tumefaciens. An alternative strategy utilizing the maize 
Ac/Ds transposon system is therefore employed to 
ascertain its efficacy of generating the substrates for 
10 homologous recombination in planta. 

Objectives 

1) General positive selection marker for plant 
transformation. In the inventor's earlier generation of 

15 gene targeting construct, the Nptll gene was used as the 
positive selection marker, providing resistance to the 
antibiotic, kanamycin. Although this works well in 
Arabidopsis and many dicotyledenous plants, it is not 
efficient for selection in many monocotyledenous plants 

20 that have been tested. The Bar gene, which confers 
resistance to herbicides that are based on 
phosphincthricin (PPT) , has been tested as an alternative 
marker. In addition to being an established positive 
selection marker for dicots as well as monocots, the Bar 

25 gene can also provide selection on soil-grown plants by 
herbicide spraying. Thus, it is a more versatile 
positive selection marker. This example describes the 
construction of a new generation of gene targeting 
vectors based on the Bar gene. 

30 2) Incorporation of negative selection markers. 

In order to streamline the efforts in identifying the 
desired recombination event, a substrate -dependent 
negative selection marker, cytosine deaminase (CodA) , is 
used. When driven by the CaMV 3 5S promoter, the CodA 
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gene provides negative selection during seed germination 
and early seedling growth in the presence of 5- 
f luorocytosine (5-Fc) [11, 13] . This construct was 
tested in Arabidopsis and was found to provide good 
5 negative selection on agar plates supplemented with 5-Fc 
(Figure 4) , as reported earlier by other researchers. 
Incorporation of the CodA expression cassette into the 
targeting construct is intended to help to minimize the 
number of random insertion events. 

10 3) Application of a transposon-based gene 

targeting strategy- Although, in principle, the 
inclusion of a negative selection marker should simplify 
the gene targeting approach, it may not be compatible 
with the preferred Agrobacteri um-mediated transformation 

15 method. With Agrobacteri urn- based plant transformation 
strategies, multiple copies of the T-DNA are often 
inserted in the genome of transformed plant cells. In 
this case, the targeted event may coexist with random 
insertions and is removed when negative selection is 

20 applied {5, 13) . 

To avoid this problem, one solution is to 
devise a method to limit the substrate for recombination 
to one copy per cell. This should then rule out the 
possibility of having multiple insertion events, through 

2 5 either illegitimate or homologous recombination. To 

accomplish this, the Ac/Ds transposon system is employed 
as an in planta generator of integration substrates (12) . 
This is described in more detail in the following 
section. 

30 In addition to solving the problem of mutiple 

integration events, this approach also broadens the 
application of gene targeting to other plant species. A 
major obstacle in applying gene targeting to 
agronomically important species, such as maize and rice, 
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is the difficulty in generating a large number of 
independent transf ormants (in the order of 1,000). Novel 
use of a transposon-based gene targeting method 
eliminates this difficulty. 

5 

Experimental Approach 

The experimental design utilizes a known 
genomic target in Arabidopsis . The general targeting, 
construct shown in Figure 1 is constructed, using 

10 standard cloning and DNA manipulation methods. As shown 
in Figure 1, the CodA gene is placed upstream of the 
polyl inker region, into which can be inserted the genomic 
sequence for targeting. For DNA excision, short (<1.5 
kb) Ds elements are placed next to the Right Border and 

15 within the 35S-GUS cassette. With this configuration of 
the construct, the frequency of excision can be assayed 
by measuring the loss of GUS activity, since this loss 
results in white sectors upon staining with X-Gluc. The 
Bar gene, flanked by the inserted genomic sequences, 

2 0 provides the positive selection marker for the insertion 

event. After construction of this vector, Arabidopsis 
genomic sequences from TGA3 are inserted into the 
polylinker sites (Fig 1, PLS1 and PLS2) . Transformation 
of Arabidopsis via Agrobacterium is carried out and the 
25 transgenic lines are selected on PPT -containing plates. 

These PPT-resistant plants, but not the wild- 
type, are also sensitive to 5-Fc, due to CodA expression. 
In addition, they have high levels of GUS activity 
(Figure 2, step 1). Ten to twenty transformed lines that 

3 0 show these characteristics are evaluated by Southern blot 

analysis no determine the copy number of the inserted T- 
DNA. Several transformed lines with single copy 
insertions are self -pollinated to produce homozygous 
plants. They are then crossed with another homozygous 
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Arabidopsis line that expresses the stablized Ac 
transposase (12) . This activates the excision of the 
sequences in the original construct that are flanked by 
the two Ds elements (Figure 2, step 2). The efficiency 
5 of excision in the Fl progenies is verified by staining 
the leaves with X-Gluc. Cells that have activated Ds 
transposition are white while the other cells are blue. 
The excised DNA can be reinserted randomly in the genome, 
in which case the CodA gene is retained (Figure 2, 

10 excised insert) . Alternatively, if the insertion occurs 
via homologous recombination at the two homology regions, 
then this marker is lost. The selection scheme entails 
screening the F2 or F3 progenies on medium containing 5- 
Fc and PPT. The surviving plants are then subjected to a 

15 secondary screen to confirm the loss of the GUS marker. 
The predicted genome structure and phenotypes of the 
targeted event are shown at the bottom of Figure 2. 
Plants recovered after these screens are analyzed via PCR 
using diagnostic primers (shown as PI and P2 in Figure 2) 

20 that specifically detect the desired targeted event (7) . 
Southern blots are performed to confirm the proper 
integration of the Bar gene into the TGA3 locus. 
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embodiments described and exemplified above, but is 
capable of variation and modification without departure 
from the scope of the appended claims. 
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What is claimed: 

1. A DNA construct for integration of 
heterologous DNA segments into genomes within cells, the 
5 DNA construct comprising termini having disposed 
therebetween : 

a) a pair of DNA substrates for a selected 
transposase, having disposed therebetween: 

i) a first cloning site and a second 
10 cloning site for insertion of one or more additional DNA 

segments, wherein the first cloning site and the second 
cloning site have disposed therebetween a positive 
selection gene encoding a gene product that confers to 
the cells a selectable phenotype comprising resistance to 
15 a positive selection agent that is deleterious or lethal 
to cells having genomes in which the DNA construct has 
not integrated; and 

ii) a negative selection gene 
disposed between one of the DNA substrates for the 

20 selected transposase and either the first cloning site or 
the second cloning site, but not between the first 
cloning site and the second cloning site, the negative 
selection gene conferring to the cells a selectable 
phenotype comprising susceptibility to a negative 

25 selection agent, to which cells having genomes in which 

the DNA construct has not integrated are not susceptible; 
and, optionally 

b) a detectable marker gene encoding a 
detectable gene product, the detectable marker gene being 

30 operably inserted in the DNA construct relative to one of 
the DNA substrates for the selected transposase such 
that, upon excision of the DNA construct from a genome by 
the action of the transposase, the detectable gene 
product is no longer detectable. 



35 



WO 00/75289 



PCT/USOO/15783 



- 29 - 

2. The DNA construct of claim 1, wherein the 
termini comprise at least one cloning site. 

3. The DNA construct of claim 1, wherein the 
5 termini comprise Agrobacterium tDNA left and right 

borders . 

4. The DNA construct of claim 1, wherein the 
DNA substrate for a selected transposase is selected from 

. 10 the group consisting of: a maize Ds element; a maize dSpm 
element, a maize rdt element, a maize Mn element, a maize 
Tam2 element, a snapdragon Tam4 element and a Drosophila 
P element . 

15 5. The DNA construct of claim 4, wherein the 

DNA substrate for a selected transposase is a maize Ds 
element and the selected transposase is a maize Ac- 
dependent transposase. 

20 6. The DNA construct of claim 1, wherein 

either or both of the first and second cloning sites is a 
polylinker. 

7. The DNA construct of claim 1, wherein the 
25 positive selection gene confers resistance to a selection 

agent selected from the group consisting of antibiotics 
and herbicides. 

8. The DNA construct of claim 7, wherein the 
30 positive selection gene confers resistance to 

phosphinothricin herbicides. 

9. The DNA construct of claim 1, wherein the 
negative selection gene encodes a gene product that 

3 5 converts an innocuous substance to a substance that is 
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deleterious or lethal to the cells. 

10. The DNA construct of claim 9, wherein the 
negative selection gene is a CodA gene. 

5 

11. The DNA construct of claim 1, wherein the 
detectable marker gene encodes a detectable gene product 
selected from the group consisting of: p-glucuronidase , 
(3-galactosidase, chloramphenicol acetyl transferase, 

10 lucif erase, green fluorescent protein, alcohol 
dehydrogenase and a transcription factor. 

12. The DNA construct of claim 11, wherein the 
detectable marker gene encodes (3 -glucuronidase . 

15 

13. The DNA construct of claim 1, wherein the 
termini comprise Agrobacterium tDNA right and left 
borders, the DNA substrate for a selected transposase 
comprises Ds substrates for a maize Ac-dependent 

20 transposase, the positive selection gene encodes 
phosphinothricin acetyltransf erase, the negative 
selection gene encodes cytosine deaminase, the detectable 
marker gene encodes (^-glucuronidase and the first and 
second cloning sites are polylinker sequences. 

25 

14 . The DNA construct of claim 1 wherein one 
or more of the positive selection gene, negative 
selection gene and detectable marker gene is a chimeric 
gene comprising a coding sequence operably linked to one 

30 or more heterologous promoters. 

15. The DNA construct of claim 14, wherein the 
promoter is selected from the group consisting of 
constitutive promoters, inducible promoters and tissue- 

35 specific promoters. 
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16. The DNA construct of claim 14, wherein the 
chimeric gene comprises a plurality of promoters. 

17. The DNA construct of claim 14, wherein the 
5 promoter is a cauliflower mosaic virus 35S promoter. 

18. The DNA construct of claim 1, which 
comprises additional cloning sites disposed between the 
first cloning site and the second cloning site for 

10 insertion of one or more additional DNA segments, the 
additional cloning site being disposed relative to the 
positive selection gene so as not to interfere with the 
conferring of the selectable phenotype. 

15 19. The DNA construct of claim 1, wherein the 

detectable marker gene in its entirety is disposed 
between one of the DNA substrates for a selected 
transposase and the terminus closest thereto. 

20 20. The DNA construct of claim 1, wherein one 

of the DNA substrates for a selected transposase is 
located within the detectable marker gene in a manner 
that does not disrupt operability of the detectable 
marker gene unless the DNA substrate is acted upon by the 

25 selected transposase. 

21. The DNA construct of claim 20, wherein the 
one of the DNA substrates for a selected transposase is 
located between the promoter and the coding sequence of 

30 the detectable marker gene. 

22. The DNA construct of claim 1, operably 
inserted into a vector for transforming a cell. 



35 



23. The DNA construct of claim 22, wherein the 
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cell is a plant cell and the vector is an Agrrcuba cterium 
vector. 

24. The DNA construct of claim 1, adapted for 
5 integrating a heterologous DNA segment at a pre- 
determined location of a genome, wherein the adaptation 
comprises inserting a first targeting segment in the 
first cloning site and a second targeting segment in the 
second cloning site, each targeting segment comprising a 

10 DNA sequence substantially homologous to sequences in the 
genome comprising or flanking the pre -determined 
location, the targeting segments enabling the DNA 
construct to integrate into the genome at the pre- 
determined location by homologous recombination. 

15 

25. A method for inserting a heterologous DNA 
molecule into a pre-determined location on a plant 
genome, which comprises: 

a) transforming a sample of plant cells 

20 containing the genome with the DNA construct 'of claim 24, 
to produce a substrate- transformed cell line; 

b) transforming an equivalent sample of 
plant cells with a gene encoding a transposase that 
specifically acts on the DNA substrates in the DNA 

25 construct of claim 24, to produce a transposase- 
trans formed cell line; 

b) regenerating fertile organisms from 
each of the transformed cell lines; 

c) crossing the substrate-transformed line 
30 with the transposase- transformed line to produce PI 

progeny; 

d) self-pollinating the Fl progeny to 
produce F2 progeny; and 

e) growing the F2 progeny in the presence 
35 of the positive selection agent and the negative 



WO 00/75289 



PCI7US00/15783 



10 



- 33 - 

selection agent, progeny plants comprising the 
heterologous DNA inserted into the pre -determined 
location on the plant's genome being capable of surviving 
in the presence of both the positive selection agent and 
the negative selection agent. 

26. The method of claim 26, which further 
comprises selecting a substrate -transformed cell line 
comprising one copy of the DNA construct per cell. 



27 . A kit for inserting a heterologous DNA 
molecule into a pre-determined location on a plant 
genome, which comprises a container containing the DNA 
construct of claim 24 and instructions for using the DNA 

15 construct to insert a heterologous DNA molecule into a 
pre-determined location on a plant genome. 

28. The kit of claim 27, which further 
comprises a DNA construct having a gene encoding a 

2 0 transposase that specifically acts on the DNA substrates 
in the DNA construct of claim 27. 

29. A method for activation tagging of a plant 
genome to create variants displaying a desired phenotype, 

25 which comprises: 

a) transforming a sample of plant cells 
containing the genome with the DNA construct of claim 1 
or claim 24, to produce a substrate -transformed cell 
line; 

30 b) transforming an equivalent sample of 

plant cells with a gene encoding a transposase that 
specifically acts on the DNA substrates in the DNA 
construct of claim 1, to produce a transposase- 
transformed cell line; 

35 b) regenerating fertile organisms from 
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each of the transformed cell lines; 

c) crossing the substrate -transformed line 
with the transposase- transformed line to produce Fl 
progeny; 

5 d) self-pollinating the Fl progeny to 

produce F2 progeny; and 

e) growing the F2 progeny under conditions 
pre -determined to select for the desired phenotype in the 
plant . 

10 

30. The method of claim 29, wherein in the DNA 
construct, the one of the DNA substrates for a selected 
transposase most proximal to the 3 ' end of the construct 
is located between the promoter and the coding sequence 
15 of the detectable marker gene. 



31- A kit for activation tagging of a plant 
genome to create variants displaying a desired phenotype, 
which comprises the DNA construct of claim 1 or claim 24, 
20 and instructions for using the construct to perform the 
activation tagging. 

32. The kit of claim 32, which further 
comprises a DNA construct having a gene encoding a 
25 transposase that specifically acts on the DNA substrates 
in the DNA construct of claim 31. 
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